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Abstract 

Let q be a power of a prime p. We prove an assertion of Carlitz which 
takes q as a parameter. Diaz- Vargas' proof of the Riemann Hypothesis 
for the Goss zeta function for V p [T] depends on his verification of Carlitz 's 
assertion for the specific case q = p [D-V]. Our proof of the general case 
allows us to extend Diaz- Vargas' proof to F ? [T]. 

In [Gosl] Goss presents zeta functions for function fields of finite character- 
istic which possess many interesting properties including analogs of properties of 
the Riemann zeta function. In this paper we prove an analog of the Riemann 
Hypothesis for the Goss zeta function for ¥ q [T] where q is a power of a prime p. 
The statement of the analog appears in Theorem 1.1 below. For the case q = p 
the analog was proven by D. Wan [Wan]. Our proof follows another proof for the 
case q = p recently put forward by Diaz- Vargas [D-V]. A key part of Diaz- Vargas' 
proof involves a result of Carlitz concerning the vanishing of certain power sums 
[Car3]. The proof that Carlitz sketched for his result depends on a combinator- 
ial assertion which takes q as a parameter. Carlitz gives no justification for this 
assertion. A restatement of Carlitz's assertion appears as our Theorem 1.2. Diaz- 
Vargas proved the assertion for q = p. This enabled him to construct his elegant 
proof of the Riemann Hypothesis for F P [T]. Poonen proved Carlitz's assertion for 
the case q = 4 [Po]. Our main result is a proof of Carlitz's assertion for all q. 



This proof was inspired by [Po] and uses some of the same ideas. Carlitz uses his 
assertion in the proofs of two results. As they are now fully justified, we restate 
them in Theorem 1.4. 

In Section 1 we define the Goss zeta function for ¥ q [T] and then state Theorem 
1.1: the analog of the Riemann Hypothesis. Next we state Carlitz's result and 
Theorem 1.2: the combinatorial assertion on which the result depends. A brief 
outline of the proof of Theorem 1.2 is given at the end of Section 1. In Section 2 
we prove Theorem 1.1. Sections 3 through 7 are devoted to the proof of Theorem 
1.2. 

The author (a combinatorialist) wishes to thank Dinesh Thakur and David 
Goss for their help with the number theory. The author thanks Dinesh Thakur 
for bringing Carlitz's assertion to his attention. Special thanks go to the referee 
who made valuable stylistic suggestions and also found many mistakes in earlier 
versions of this paper. 

1. Main Results 

In this paper N denotes the set of nonnegative integers and we set Z + := N\{0}. 
Let p be prime and set q := p s . Let v denote the T _1 -adic valuation on K : = 
¥ q (T). Then the field of Laurent series := ¥ q ((T~ 1 )) is the completion of K 
with respect to v. Denote by A + the set of monic polynomials in A := F ? [T]. Let 
Z p denote the p -adic integers and let be the completion of an algebraic closure 
of Koq. The analogy with characteristic zero is given by A <H> Z, A + <H> Z + , 
K <H> Q, A"oo ^ IR, and H C. The Goss zeta function for ¥ q [T] is defined as 

C(z) := ]T „-* 

neA+ 

where z is taken from 0* X Z p . Exponentiation is defined as follows: for a monic 
polynomial n set (n) := nT~ desn ; then for z = (x,y) £ 0* X Z p Goss defines 

n z ._ x de g n _ 

The term (n) y is well defined since (n) = l(modT -1 ). One draws an analogy be- 
tween this definition and complex exponentiation when C is regarded canonically 
as 1 X 1: for positive integers n we have n^ x,y ^ = (e :E ) log "'(e 8log "') !/ . Goss showed 
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that by grouping together terms of the same degree ( becomes well defined over 
all 0* x Z„: 



C(^):=C(^,y):=]>>~ m E 

m>0 \ n £A+ , degra=m 



From the definition of exponentiation we have nS ' > = n for any integer k. 
Define ((k) := ((T k ,k) for k £ Z. Thus when A; > we have £(£;) = XL e ^+ 
The values of ( on the set of "integers" {(T k , £;) : A; £ Z} are analogous to special 
values of the Riemann zeta function: 

• Carlitz showed the following analog to a theorem of Euler. For k > we 

have 

(((t _ 1)(t) = W!^ 

9{q-l)k 

where tt £ is an analog to 2iri } the B( q -\)k £ A" are analogs to the even 
Bernoulli numbers, and the g[ q -\)k £ ^4 are analogs to factorials [Carl][Car2]. 

• Goss showed (in general, not just for the ¥ q [T] case) that for k > we have 
((—k) £ A and ((—k) = when k = (modg — 1) [Gosl]. In analogy, the 
Riemann zeta function is rational on the negative integers and zero on the 
negative evens. 

Consult [Gos2] for more interesting properties of Goss' zeta functions including 
their connection with cyclotomic extensions and with Drinfeld modules. 

In Section 2 we prove the following analog to the Riemann Hypothesis. It 
states that for fixed y the zeros of ((x } —y) are simple and all lie on the same 
"real line". For a complete explanation of the analog see [Gos2]. 

Theorem 1.1. Fix y £ Z p . As a function of x, the zeros of ((x } —y) are simple 
and lie in K^. In fact they lie in the suhheld F P ((T -1 )). 

In [Car3] Carlitz investigated, among other things, the vanishing of the power 
sums 

S' k (N) := nN 

n£A+ , degn=k 
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for positive integers N. He stated that S' k (N) ^ if and only if there exists a 
(£;+l)-tuple (r , ri, . . . , r^) G whose terms sum to N and satisfy the following 

two conditions: 



(i) there is no carryover of p-adic digits in the sum N 

(ii) Vj > and (p s — 1) | rj for < j < k — 1. 



(Note that (ii) puts no condition on r k .) Let £4+i(iV) be the collection of all such 
(k + l)-tuples. Then Carlitz claimed that S' k (N) ^ if and only if Uk+i(N) ^ 0. 
His proof went as follows. For a monic n G A + set n = a + aiT 1 + - • ■-\-ak-iT k ~ 1 + 
T k . Then 

s' k (N) = J2( N )E a s o --- a *-i Tri+2r2+ "" + * rfc 

^ V r o, ■ ■ - ,r k J ^ 

where the outer sum is over all {k + l)-tuples r = (r , . . . , r k ) such that ^2rj = N 
and the inner sum is over all (a , . . . , (ik-i) G ( IF g ) ^ . Note that the sum ^ aeF a h 
equals —1 when h is a positive multiple of q — 1, and equals otherwise. Thus 
(1.1) becomes 

s'(n) = y( N V-i) fc r ri+2r2+ - +fcr '= (i.2) 

^ \r , ...,r k J 

where the sum is over all (k + l)-tuples (r , r i} . . . , r^) such that Yl r j = N an d 
also satisfy condition (ii) above. A well known result of Lucas states that the 
multinomial coefficient 

JV \ M 



v r ,...,?V r \---r k \ 

is not equivalent to (mod p) if and only if there is no carryover of p-adic digits in 
the sum r r So we can take the sum in (1.2) over Uk+i(N). Thus if Uk+i(N) = 
then S' k (N) = 0. To obtain the converse Carlitz asserted without proof that 

the degree r\ + 2r 2 + • • • + kr k of a monomial appearing 

in (1.2) attains its unique maximum when [r k , r^-i, . . . , r ) (1-3) 

is lexicographically largest among all elements of Uk+i(N). 

We now present an equivalent but slightly different version of (1.3 ). The 
changes make our later notation and definitions a bit easier to digest. For an 
ra-tuple X = (X\, • • • , X m ) G N m we define the weight of X as 

wt{X) := X 1 + 2X 2 + ••• + mX m . (1.4) 



4 



Given a finite subset W C N m , a tuple O £ W is said to be optimal in 14 7 
if wt(0) > u;t(X) for all X £ W\ The greedy element of W is that tuple 
(Gi, . . . , Cr m ) £ W for which (G m , G m -i, . . . , G\) is largest lexicographically. 

A composition of iV £ Z + is a tuple X = (X\, • • • , X m ) of positive integers that 
sum to N. We say that the ra-tuple X is a valid composition of N if, in addition, 
there is no carryover of p-adic digits in the sum N = ^2X 17 and (p s — 1) | Xj for 
1 < j < m — 1. 

Define K„(iV) to be the set of all valid compositions of N of length m. Note 
that 



Most of this paper is devoted to proving the following theorem. 

Theorem 1.2. IfV m (N) is not empty then it contains a unique optimal element. 
Further, the optimal element is the greedy element ofV m (N). 

Theorem 1.2 implies its counterpart for U m (N) which we state in Lemma 1.3 
below. The proof of Lemma 1.3 uses Proposition 4.6. This proposition states that 
if (p s — 1) divides N, and G is the greedy element of V m (N), then 



Lemma 1.3. If U m (N) is not empty then it contains a unique optimal element. 
Further, the optimal element is the greedy element of U m (N). 

Proof. First assume that V m (N) is not empty. Since V m (N) C U m (N) the 
lemma follows from Theorem 1.2 once we show that V m (N) contains the greedy 
and all optimal elements of U m (N). Note that if (X\, . . . , X m ) £ U m (N), then 
X m = N (modp s — 1) by definition. Thus, by (1.5), if N is not divisible by 
(p s - 1) then U m (N) = V m (N). So suppose (p s - 1) divides N. Since V m (N) is 
not empty its greedy element coincides with that of U m (N). By (1.5), all tuples 
in U m (N)\V m (N) have the form (X\, . . . ,X m _i,0) and have the same weight as 
the corresponding elements (X\, . . . ,X m _i) in V m -\(N). Now (1.6) implies that 
the weights of these tuples are strictly less than the weight of the greedy element 
in V m (N). Thus if V m (N) is not empty then it contains all the optimal elements 
of U m (N). If on the other hand V m (N) is empty then U m (N) is essentially equal 
to V m -i(N). The result follows. ■ 



V m (N) = {(X u ■ ■ ■ , X m ) £ U m (N) :X m >0} 
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By comparing (1.2) with the definition of weight (1.4) one see that the degree of 
a monomial appearing in S' k (N) has the form wt(R) — N where R = (r , . . . , r k ) G 
Uk+i(N). Thus Lemma 1.3 is precisely Carlitz's assertion (1.3). In [Car3] Carlitz 
actually uses (1.3) in two arguments: the one given above and another concerning 
the vanishing of the sums 

S k (N) = ]T n N . 

The next theorem includes a restatement of these results which are now justified 
by Theorem 1.2. The proof of Part (a) was given above. The proof of Part (b) is 
similar, see [Car3]. Part (c) follows from Carlitz's assertion (1.3) as was observed 
in [Tha]. 

Theorem 1.4. (a) S' k (N) ^ if and only if U k+1 (N) ^ 0. 

(b) S k (N) ^ if and only ifV k (N) ^ and p s - 1 divides N. 

(c) If S' k (N) ^ then its degree is wt(G) — N where G is the greedy element 
ofU k+1 (N). 

We now outline Sections 3-7 which make up the proof of Theorem 1.2. In 
Section 3 we set up our notation, present most of our definitions, and prove some 
basic results. In Section 4 we give a few more definitions and list several technical 
results. We end Section 4 with a proof of Theorem 1.2 for the case 5 = 1 (i.e., 
q = p). Sections 5, 6, and 7 constitute the proof for the case s > 2. Form the 
set ^ of all pairs (m, iV) such that V m (N) contains an optimal element which is 
not the greedy element. Then Theorem 1.2 is equivalent to the statement "^P is 
the empty set." For each positive integer N let £(N) be the sum of the p-digits 
of N. We assume that ^ is not empty and choose (m, N) G ^ so that (m,£(N)) 
is lexicographically minimal. Under this assumption and using our chosen m and 
N } we show that there exists an optimal composition in V m (N) whose last part 
m is much smaller than the last part G m of the greedy composition in V m (N). 
The existence of O is established in Section 5. In Section 6 we use G m and the 
components of to carefully construct a third composition Z G V m (N). Then we 
show in Section 7 that wt(Z) is strictly larger than wt(0). This contradicts the 
fact that O is optimal and so we conclude that ^ is indeed empty. 



6 



2. Proof of Theorem 1.1 



Let p be prime and set q = p s . In this section we follow [D-V] and use Theorem 
1.2 to prove Theorem 1.1: for fixed y £ 7L P the zeros of ((x } — y), as a function of 
x , are simple and lie m¥ p ((T 1 )). 

Proof, (of Theorem 1.1) For (x,y) £ 0* X 7L P we have 

C{x,-y):=Y J ^ m [ £ (^n) 

m>0 \rx£A+, degra=m 

Fix y £ 7L P and view ((x } —y) as a power series in the variable x~ l . If y £ 
N\{0}, then T~ my S' m (y) is the coefficient of x~ m . Further, (1.2) implies these 
coefficients are in ¥ P ((T~ 1 )). Since N is dense in Z p , the coefficients of ((x } —y) 
are in ¥ P ((T~ 1 )) for any y. 

Define v m (y) to be the valuation of the coefficient of x~ m in ((x } —y). The 
Newton polygon for ((x, —y) is the lower convex hull in IR 2 of the points 

{(m,v m (y)) : m > 0} . 

Its sides describe the valuations of the zeros of ((x } — y): if the Newton polygon 
for ((x } —y) has a side of slope A whose projection onto the horizontal axis has 
length /, then ((x } —y) has precisely / zeros with valuation A. Here, zeros are 
counted with multiplicity. When u m _i(y) and v m (y) are both finite define 

X y (m) := v m (y) - u m _i(y). 

Then when defined, \ y (rn) is the slope of the line segment 

from (m - 1, u m _i(y)) to (m, v m (y)). (2.1) 

In Case I below we show that if ((x } —y) is a polynomial of degree d then X y is 
both defined and strictly increasing on the set {1,2, . . . , off; in Case II we show 
that if ((x } —y) is not a polynomial then X y is defined and strictly increasing on 
all of Z + . In each case we will have shown that these line segments (2.1) are the 
sides of the Newton polygon for ((x } —y). Since the line segments have horizontal 
length one, we will have shown that each root of ((x } —y) is simple and lies in 
^((T- 1 )). 
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Case I: Assume y is a positive integer. Then T~ my S' m (y) is the coefficient of 
x~ m in ((x,—y). Since U m+ i(y) is empty for large enough m, Theorem 1.4(a) 
implies that ((x,—y) is a polynomial of x~ l . Let d be the degree of ((x,—y). 
Then Uk(y) is empty for k > d + 1. Note that ifl < £; < o? + 1 then Uk(y) ^ 0: 
clearly Ud+i(y) ^ 0; if (^i, • • • , A^+i) G Ud+i(N) then for any 1 < k < d we have 
(X 1 ,...,X k + h X d+1 ) G U k (N). By Theorem 1.4(a) we have 

v m (y) = { ™y-*<*S' m (y) |fO<m<d _ ^ 
[ oo n m > d 

Thus X y is defined on {1, . . . , d}. Further, for 2 < m < d we have 

X y (m) - Xy(m - 1) = (u m (y) - u m _i(y)) - (u m _i(y) - u m _ 2 (y)) (2.3) 
= [degS^.^y) - degS^(y)] - [degS^_ 2 (y) - degS^.^y)] 
= [deg S'^ (y) - deg S' m _ 2 (y)] - [deg S' m (y) - deg S'^ (y)] . 

Let F = (F u . . . , F m _i), G=(G U ..., G m ), and # = . . . , # m+1 ) be the 
greedy elements from U m -i(y), U m (y) and U m+ i(y). By Theorem 1.4(c) we have 

A(m) - A(m - 1) = [iwt(G) - wt(F)] - [wt(H) - wt(G)]. (2.4) 

Combine the last two components of H to form Z := (Hi, . . . , i7 m _i, i7 m + _f/ m+ i). 
Then Z is an element of U m (y) and wt(Z) = wt(H) — H m+ i. By Lemma 1.3 we 
have wt(Z) < wt(G). Thus from (2.4) we get 

A(m)-A(m-1) > [u;t(Z) — ^t(F)] — [wt(H) — wt(Z)\ 

= [wt(Z)-wt(F)]-H m+ i. (2.5) 

Drop the last component of H to form Y := (Hi, . . . , i7 m _i, i/ m ). Since i7 is the 
greedy element of U m+ i(y), the composition is the greedy element of U m (y — 
H m+ i) (Proposition 4.1(a)). Also, we have 

wt(Y) = wt(Z) - mH m+1 . (2.6) 

Note that Y is also the greedy element in V m (y — H m+ i). Since (q — 1) \y — H m+ i, 
Proposition 4.6 implies 

<wt (Y)-(m-l)(y- H m+ i). (2.7) 
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Add H m+ i to each side of (2.7) and then combine with (2.6) to get 

H m+1 < wt(Y) + mH m+1 - (ra - l)y 

= wt(Z) - (m - l)y. (2.8) 

Now from (2.5), first apply the fact that the weight of F = . . . , F m _i) is less 
than or equal to (m — l)y and then apply (2.8) to get 

\ y {m) - \ y {m - I) > [wt(Z) - wt(F)] - H m+1 

> wt(Z) - (ra - l)y - H m+1 

> 0. 

This completes Case I. 

Case II: Assume that y £ 7L, p — N so that y = y + yip + • • • + yiP 1 + • • • has an 
infinite number of nonzero digits. We need the following easy lemma. 

Lemma 2.1. Let yit) he the sum of the first t terms in the p -adic expansion of 

y- 

t 

2/W := ^2y*p l - 

8 = 

Then for any fixed m there exists a positive integer t' such that U m (y(t)) is 
nonempty for all t > t' . 

Proof. Fix m. We construct t' and an element X = (Xi } . . . } X m ) of U m (y(t)) 
for any t > t' . For < h < s — 1 set 

oo 

\ ^ is+h 

Zh := z^y^+hp ■ 

8 = 

Then y = z -\- ■ ■ ■ + z s _i. There exists an h! such that has an infinite number of 
nonzero p-adic digits. Form the nondecreasing infinite sequence cr(zh>) consisting 
of yi s +h' copies of p ts+h ' for each i > 0. Set X\ equal to the sum of the first p s — 1 
terms of cr(zh>). Set X 2 equal to the next p s — 1 terms and continue in this way 
up through X m -\. Define t' so that p f is the smallest term in cr(zh>) which was 
not assigned. For any fixed t > t' set X m = yit) — (X\ + • • • + X m _i). It is now 
easy to see that X £ U m (y(t)). ■ 
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Let yit) be as in the lemma. It is sufficient to show that for any m there exists a 
t m such that if t > t m then v m (y) = v m (y(t)). For in that case X y is defined for 
all m > 1. Further, (2.3) implies 

Xy(m) - X y (m - 1) = Xy( t )(m) - X m (m - 1) 

for m > 2 and any t > max{t m _2, t m -i } t m }. Thus Case II is reduced to Case I. 

We show that for all m > there exists a t m such that if t > t m then v m (y) = 
v m {y{t)). Since v (y) = for all y, we may assume m > 1. By Lemma 2.1, there 
exists a positive integer t' such that /7 m+ i (y(t)) is nonempty for all t > t' . For 
t > t' denote the greedy element of U m+ i (y(t)) by G t = (G\, G 2 , . . . , Note 
that for any X £ £7 m +i (y(t)) we have 

u;t(X) = Xi + • • • + mX m + (m + l)X m+ i and 

= (m + l)y(<) - (m + l)^i (m + l)X m+1 . 

By adding we deduce 

urfpQ - y{t) = my(t) - (mX 1 + ■■■ + 2X m _ x + X m ). 

Thus for t >t' we have by Theorem 1.4(c) and (2.2), 

v m (y(t)) = my(t) - degS' m (y(t)) = ™G\ + • • • + 2Gt-i + G l m . (2.9) 

Thus v m (y(t)) depends only on the first m terms of G 1 . We claim that there 
exists t m such that if t > t m then G 1 and G tm differ only in their (ra + l) s * term. 
That is, we have G\ = G* m for 1 < i < ra, so that u m (y(t)) = v m (y(t m )). To see 
this, note first that G 1 = (G\, G 2 , . . . , G f J n+1 ) £ /7 m+ i (y(t)) implies 

(<?*,..., G* m , GL+i + e £Wi (y(* + i)) • (2.10) 

Secondly, note that by greediness, y(t + 1) — G^^ is minimal in the set 

{y(t + 1) - X m+1 : X £ U m+1 (y(t + 1))} . (2.11) 
Thus (2.10) and (2.11) imply 

w + 1) - g^Ii < m + 1) - (gi + i + = m - gi +1 . 
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In other words, yit) — G i m+1 monotonically decreases as t increases. Since yit) — 
G i m+1 is bounded below by zero, we can choose t m large enough so that 

y(t) ~ Gl+i = y(tm) - G% +1 for all t > t m . 

Finally, note that the greedy element of U m (y(t) — G t m+1 ^) is obtained by dropping 

the last term from the greedy element G 1 = (G\ } G f 2 , ■ ■ ■ , of U m+ \ (y(t))(Proposition 

4.1(a)). Thus we have if t > t m then G\ = G\ m for 1 < i < m. Now (2.9) implies 

we have v m (y(t)) = v m (y(t m )) for all t > t m . Since y = lim^oo yit) we have 

v m {y) = v m (lim^oo y(t)) = lim^oo v m (y(t)) = v m (y(t m )). ■ 

3. Notation and definitions 

In this section we set up the notation, present some definitions, and state some 
basic results. We begin with a discussion of the conditions a composition X = 
(Xi } . . . , X m ) of N must meet in order to be valid: 

(i) there is no carryover of p-adic digits in the sum N = ^ Xj] 

(ii) Xj is a multiple of p s — 1 for 1 < j < m — 1. 

To deal with condition (i) we treat a given positive integer N as the shortest 
nondecreasing sequence cr(N) of powers of p whose terms sum to N. For N > 
define deg p (iV) to be the exponent of the largest power of p appearing in &(N). 

Example. Let p = 3 and N = 131. In base 3 we have N = 1 1212 3 . Thus 
a(N) = (1, 1, 3, 3 2 , 3 2 , 3 3 , 3 4 ) and deg p (N) = 4. 

We view a sequence as an ordered multiset: a set in which an element may appear 
more than once. A partition of a multiset M is a collection of non-empty multisets 
whose disjoint union is M. Thus, a composition (X\, ■ ■ ■ } X m ) of N satisfies (i) if 
and only if {cr(Xi), • • • , cr(X m )} is a partition of &(N). 

To deal with condition (ii) we use the fact that a number is divisible by p s — 1 
if and only if the sum of its p s -adic digits is divisible by p s — 1. In order to keep 
track of p-adic digits when summing p s -adic digits we define the map r : N — >■ W 
as follows. Given N £ N with p-adic expansion N = ^j>o n J^' define T(iV) to 
be the column vector [u , u i} . . . , u s _i]* where it 4 - is the sum of all nj such that j = 
«(mod s) (and the superscript 't' indicates transpose). If T(iV) = [u , u i} . . . , u s _i]* 
then the sum of the p s -adic digits of N is u + U\p + • • • + u s _ip s ~ l . 
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Example. For p = 3 and s = 2, we have r(11212 3 ) = [5, 2]*. 

Set ip := . . . , p 5-1 ]* and let (*, *} be the standard inner product on IR S . Then 
a composition (X\, ■ ■ ■ } X m ) of N satisfies condition (ii) if and only if 

(p s - 1) | (V>o, r (Xj)) for 1 < j < m - 1. 

Notice that T is not in general an additive function. However, when (X\, . . . , X m ) 
satisfies condition (i) we have for any {ji, . . . , jk} C {1, 2, . . . , m}, 

T(X J1 ) + ... + T(X Jk ) = T(X J1 +... + X Jk ). 

Define the following partial order: for two vectors x = [x 0} . . . , x s _i]* and y = 
[yo, • • • , Vs-iY m we write x < y if and only if X{ < y 4 - for < i < s — 1. It is 
important to note that x < y means X{ < y 4 - for < z < s — 1 with < y 4 - for at 
least one i. 

Example. Set p = 3 a_nd 5 = 2. We construct all valid compositions of 11212 3 
with m = 2 components. In order to satisfy (i) we partition cr(11212 3 ) into two 
parts (Ei, E 2 ); then we set X{ equal to the sum of the elements of E 4 -. In order to 
satisfy (ii) we take Ei so that 8 divides T (Xi)^. Tie o_nIj vectors v G N 2 \{0} 
suci that v < [5, 2]* and 8 divides (t/S , v) = u + 3ui are [5, 1]* a_nd [2, 2]*. Now it 
is easy to verify that V 2 (11212 3 ) = {(11202 3 , 10 3 ), (11110 3 , 102 3 ), (11011 3 , 201 3 ), 
(10212 3 , IOOO3), (I2IO3, 10002 3 ), (IIII3, 10101), (1012 3 ,10200 3 )}. 

Given a composition X = (X\, . . . } X m ) define TX to be the s X m matrix 
with columns T(X\), . . . , Y(X m ). 

Example. Set p = 3 and s = 2. If X is (11202 3 ,10 3 ) or (10212 3 , 1000 3 ) then 

r 2 3 

. For any other X G 14(112123 ) above, we have TX = 2 ' 

In the proof of Theorem 1.2 we assume there exist m and N such that V m (N) 
contains an optimal composition O which is different from the greedy composition. 
Our goal is to arrive at a contradiction by constructing a composition Z G V m (N) 
such that wt(Z) > wt(0). To construct Z we first construct an s X m matrix 
B and then define Z to be optimal among those valid compositions X for which 
TX = B. We now discuss the construction of Z from B. 

For < i < s — 1 define Ti(N) to be the subsequence of cr(N) consisting of all 
p k G &(N) such that k = i (mods). 



TX 



5 
1 1 
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Example. Set p = 3, s = 2, and N = 11212 3 . Then <r(N) = (1, 1, 3, 3 2 , 3 2 , 3 3 , 3 4 ) ; 
r(iV) = [5, 2]* and we iave r (iV) = (1, 1, 3 2 , 3 2 , 3 4 ) and Tl (N) = (3, 3 3 ). 

Note that if T(N) = [u 0} u i} . . . , u s _i]* then U{ is the length of T 8 (iV). 

Consider two components Xi and Xj, with i < j, of a valid composition 
X = (Xi, . . . , X m ) of N . Suppose p k is a term in Th(Xi) and p l is a term of Th(Xj) 
(so that k = / = h (mod s)). It is easily seen that 

X' := (X u . ..,X l -p k +p l ,...,X J -p l + p\ ...,X m ) 

is also a valid composition of N. We say a valid composition X = (X\, . . . } X m ) 
is r-monotonic if and only if for all 1 < i < j < m and < h < s — 1, the 
largest term of t/^JQ) is no larger than the smallest term of Th(Xj). Equivalently, 
X = (X\, . . . , X m ) is r-monotonic if and only if for all < h < s — 1 the sequence 
Th(N) is simply the concatenation of the subsequences Th(X\), . . . , Th(X m ). 

Lemma 3.1. The greedy and all optimal elements ofV m (N) are t -monotonic. 

Proof. If k > I in the definition of X' above, then X' has a larger weight than 
X and (X m , . . . , Xj — p l + p k , . . . , Xi — p k + p\ . . . , X\) is lexicographically larger 
than (X m , . . . ,Xi). ■ 

Given an s X m matrix B, denote by V^(N) the set of all valid compositions 
X of N such that TX = B. 



Lemma 3.2. IfV^(N) is not empty then it contains a unique t -monotonic com- 
position. 

Proof. A composition X = (X±, . . . , X m ) £ V^(N) is uniquely determined by 
the sequences Th(Xj). If X is r -monotonic then for each fixed h the sequences 
Th(Xj), 1 < j < m, are uniquely determined by the sequence Th(N) and the h th 
row of the matrix B. ■ 



Example. Set p = 3 and s = 2. We construct the r-monotonic composition 



[Z U Z 2 ) in y 2 B (11212 3 ) where B 



2 3 
2 



. The sequences 



70(20:= (1,1), r (Z 2 ):=(3 2 ,3 2 ,3 4 ) 
71(20:= (3, 3 3 ), r 1 (Z 2 ):=0. 
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are forced by T-monotonicity. Thus we have Z\ := (1 + 1) + (3 + 3 3 ) = 1012 3 and 
Z 2 := 3 2 + 3 2 + 3 4 = 10200 3 . 

We now specify the conditions B must meet in order for V®(N) to be non- 
empty. To this end we use a matrix to characterize the set of column vectors 

3 := {r(&) : k is a positive multiple of p s — 1} . 

Let e , . . . ,e s _i denote the standard basis of column vectors for IR S . Define e 8 - : = 
pei_i— e 4 - for < i < s — 1. Here and for now on indices which should range 
from to s — 1 are evaluated modulo s : i.e., a 4 - = aj when i = i (mods). Thus 
£o = P^s-i— eo- Define the s X s matrix E := [e , £1, • • • , £ s -i]- For example if 
p = 5 and 5 = 3 then 

r -1 5 

E= 0-1 5 

5 0-1 

Below we show that 3 = (EZ S ) n (N s \{0}). We will often use the fact that for 
two arbitrary column vectors u = [u , . . . , u s _i]* and a = [a , . . . , a s _i]*, if u = Ea 
then 

Uj = pctj + i — a, j. 

To construct the inverse of E, define R := [ei,e 2 , . . . ,e s _i,e ] to be the permu- 
tation matrix which rotates the coordinates of a vector to the right: Re 8 - = e 8 +i. 
For 1 < i < s — 1 define ipi := RVo- Then 



~ { J otherwise ' ^ 



Thus E 1 = (p s — 1) 1 [?/> , ipi, ■ ■ ■ , V's-i]*- We list some simple facts about the 
vectors ipi. 

Lemma 3.3. (a) For k G N, k = (tp , F(k)) (modp s - 1). 

(b) For u G Z s , (^0, Ru) = p (jp ^ u) (modp s — 1). 

(c) For u G Z s , ^ ,u) = p l (ipi,n) (modp s — 1). 

(d) For x G R s , {ipi, Ex) = x-{p s - 1). 

Proof. Part (a) is clear. For (b) note that 

(V> ,Ru) = p \ y2 u ip) ~ u s -i(p s - 1) = p(V> ,u) - u s _i{p s - 1). 
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Part (c) follows from (b). Part (d) follows from (3.1). ■ 

The next lemma gives us a useful characterization of the set 3- 

Lemma 3.4. Let k be a positive integer. Then k is a multiple of p s — 1 if and 
only ifT(k) = Ea for some a £ Z s . In other words 3 = (EZ S ) n (N s \{0}). 

Proof. Suppose T(k) = Ea for some a £ Z s . Then by Lemma 3.3(d), r(&)^ = 
a (p s — 1). By Lemma 3.3(a), A; is a multiple of p s — 1. 

Suppose A; is a positive multiple of p s — 1. Set x := E _1 r(A;). It is sufficient to 
show that x £ Z s . From the expression for E -1 given above, 

s-1 

x = (f - l)" 1 [V> , V>s-i] * T(k) = Y,(p S ~ I)"' (h, e t - 

8 = 

By Lemma 3.3(a), p s — 1 divides r(k)^. Lemma 3.3(c) implies that each 
^i,u^ is divisible by p s — 1. Thus x £ Z s . ■ 

Now we state precisely when V^(N) is non-empty. 

Lemma 3.5. Let B = [bi,...,b m ] be an integer matrix. The set V^{N) is 
nonempty if and only if (a) the columns of B sum to T(iV) and (h) bi, . . . , b m _i 
are members of 3 and h m > 0. 

Proof. If X £ V^(iV) then B = FX certainly satisfies (a) and (b). Conversely, 
suppose that B = [6 8 J ] satisfies (a) and (b). We construct the r-monotonic element 
of V^(N). By condition ( a) one can form subsequences 8j i, 0;,2, • • • , 0;,m of 
Ti(N) such that 8J has length 6 8J and such that Ti(N) is the concatenation of 
the sequences 8j i, 0;,2, • • • , 0;,m for < z < s — 1. For 1 < j < m define 

s-1 

Xj to be the sum of the elements of |J Then TX = B. By construction 

8 = 

{cr(Xi) } ■ ■ ■ , cr(X m )} is a partition of cr(N). Condition (b) implies that Xj is a 
positive multiple of p s — 1 for 1 < j < m — 1 and X m > 0. Thus X £ V^(iV). ■ 
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4. Some preliminary results and a proof of the q = p case. 

This section lists several technical results and ends with a proof of Theorem 1.2 
for the case q = p. Our first result consists of several observations on how to 
obtain new optimal (or greedy) compositions from old ones. 

Proposition 4.1. Suppose X = (X\, . . . , X m ) is an optimal (or the greedy) com- 
position in V m (N). Then 

(a) (X\, . . . , X m -\) is an optimal (or the greedy) composition in V m -\ (N— X m ). 

(h) If p k G <j(X m ) and X m > p k then (X±, . . . , X m _i, X m — p k ) is an optimal 
(or the greedy) composition in V m (N — p k ). 

(c) For any integer n > 0, the composition (p n Xi } . . . } p n X m ) is an optimal (or 
the greedy) composition in V m (p n N). 

Furthermore the six statements remain true when the sets V^(*) are replaced 
with the sets £/*(*) throughout. 

Proof. The twelve statements are all easily proved with similar arguments. The 
four statements of Part (a) are easiest. We prove Parts (b) and (c) for the sets 

K(*). 

For part (b), note that the assumptions p k G cr(X m ) and X m > p k imply that 
X^ := (Xi, . . . ,X m — p k ) is a valid composition in V m (N — p k ). Suppose X^ is 
not optimal (or not greedy). Then there exists a Y = (Y]_, . . . , Y m ) G V m (N — p k ) 
such that wt(Y) > wt(X^) (or such that (Y m , . . . , Y\) is lexicographically larger 
than (X m -p k ,...,X 1 )). Clearly (Y u Y m +p k ) G V m (N). Note that the weight 
of (Y]_, . . . , Y m + p k ) is strictly greater than wt(X) (or that (Y m + p k , . . . , Y\) is 
lexicographically larger than (X m} . . . ,X\)). This is a contradiction. 

For part (c), first note that for all i the i th term of a(p n N) is just p n times the 
i th term of cr(N). Thus {a(p n Xi), . . . ,cr(p n X m )} is a partition of a(p n N). Since 
p s — 1 | p n X 1 for j < m — 1 we have 

:= (p n X u . . . ,p n X m ) G V m (p n N). 

Suppose X^ is not optimal (or not greedy). Then there exists a Z = (Zi, . . . , Z m ) G 
V m (p n N) such that wt(Z) > wt(X^) (or such that (Z m , . . . , Z\) is lexicograph- 
ically larger than (p n X m} . . . } p n Xi)). Since each term in cr(p n N) is divisible by 
p n } we have {cr(p~ n Z\), . . . , cr(p~ n Z m )} is a partition of &(N). Clearly p~ n Z rj is 
divisible by p s — 1 for 1 < j < m — 1. Thus 

(p- n Z 1 ,...,p- n Z m ) G V m (N). 
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This is a contradiction since the weight of (p n Z i} . . . } p n Z m ) is strictly greater 
than wt(X) (or (p~ n Z m} . . . , p~ n Z\) is lexicographically larger than (X m} . . . , ^1)). H 

The remaining results of this section depend on certain subsets of N s whose 
definitions we now motivate. Let X = (X 1 ,...,X m ) G V*(N) where B = 
[bi,b 2 , . . . , b m ] . Suppose that for some j < m — 1 we have bj = iii + u 2 with 
Ui,u 2 G 3- Then Lemma 3.5 (applied to the matrix [ui,u 2 ]) implies there exists 
a valid composition (Yi 7 Y 2 ) of X r Clearly 

X' := (Xi, . . . , Yi,Xj +1 , . . . , X m + F 2 ) 

is a valid composition of N. Further, it is clear that wt(X') > wt(X) and also 
that X' is, in our reverse lexicographic order, larger than X. Thus X can be 
neither optimal nor greedy. In other words, if X = (X\, . . . , X m ) is either optimal 
or greedy then for 1 < j < m — 1 we have V^Xj) = 0. To take advantage of 
arguments such as this we define following the subsets of N s : 

I m : = {T(k) : k G N and V m (k) + 0} ; 

J'm • — ii l~l ( Im\Im-\-l ) ■ 

We argued above that if (X\, . . . , X m ) is either optimal or greedy then T(Xj) G J\ 
for 1 < j < m — 1. 

We have two alternate ways of expressing 7 m : 

I m = {u G N s : 3vi, . . . , v m _i G 3 such that u > Vi + • • • + v m _i} (4.1) 
= {u G N s : 3?7i, . . . , ?/ m _i G 3 such that u > fj m -i > • • • > fji} ■ (4.2) 

Expression (4.1) follows from Lemma 3.5 by taking 

B = [vi, • • • , v m _i,u — (vi H h v m _i)] . 

Expression (4.2) is gotten from (4.1) by taking fjj = V\ + • • • + Vj. It is clear that 

Im ^ h whenever k < m. 

From (4.1) we see that J m is the set of all u G 3 such that u decomposes into a 
sum of m elements o/3 but not into a sum of m + 1 elements o/3- 

Proposition 4.3 below is the seminal result concerning the subsets I m and J m . 
Before we prove it we must list some basic facts about the matrices E and R. Recall 
that E = [£ ,£i, • • • , £ s -i] where e 8 - = p e 4 -_i— e 4 - and that R = [ei,e 2 , . . . ,e s _i,e ] 
is the matrix which rotates the coordinates of a vector to the right. 
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Lemma 4.2. (a) {E _1 x : x > 0} C (R + ) s . 
(h) Ifx > y then E _1 x > E _1 y. 

(c) RE = ER. 

(d) R5 = 3. 

(e) For anykeN we have R k T(N) = T(p k N). 

Proof. Part (a) follows from the fact that all the terms in E -1 are positive. Part 
(b) is equivalent to Part (a). For Part (c) note that Re 8 - = Ei + \ for all i. Thus we 
have 

RE = [ei, e 2 , • • • , £ s -i, £o] = * • Ee 2 , . . . , Ee s _i, Ee ] = ER. 

Now (d) follows from Lemma 3.4 and the fact that R and E commute. For Part (e) 
let N = n + n-ip + • • • + nip 1 be the p -adic expansion of N. Set [u , . . . , u s _i]* : = 
T(N) and [v , . . . , u s _i]* := T(p k N). Since p k N = n p k + n lP 1+k + ■■■ + n lP l+k is 
the p-adic expansion of p k N we have Vi + k = it 4 -. ■ 

Proposition 4.3. For m > 1 we have, 

(a) I m = {Ex £ N s : m - 1 < min{x , . . . , x s _i}} ; and 

(b) J m = {Ea £ f : m = min{a , . . . , a s _i}} 

wiere x = [x , . . . , ^s-i]' &nd a = [a , . . . , cis-iY are taken from IR S . 

Proof. First we show that (a) implies (b). Assume (a) and let a £ IR S . Suppose 
first that Ea £ J m . By definition Ea £ 3 H (I m \I m+ i). Since Ea £ (I m \I m +i), 
Part (a) implies 

m — 1 < min{a , . . . , a s -i} < 
Since Ea £ 3 we have a £ Z s . Thus m = min{a , ai, . . . , a s _i} and therefore 

^ {Ea £ N s : m = min{a , , • • • , a s _i}} . 

For the other inclusion suppose Ea £ N s and m = min{a , . . . , a s _i}. Then (a) 
implies Ea £ (I m \I m+ i). To show Ea £ 3 we show by reverse induction that 
a £ Z s . Suppose = m and suppose further that a 8+ i is an integer for some 
i + 1 < h. Set [w 0} . . . , Ws-iY := Ea. Then 

Wi = pa i+1 - a t . 

Since Wi and a 8+ i are integers, so is a 4 -. Thus a £ Z s . Therefore Ea G 3 fl 
(/ m \/ m +i) = J m and (a) implies (b). 
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We prove Part (a). Suppose u £ I m . Set x := E _1 u. By (4.1) there exist 
vectors vi, . . . , v m in 3 such that u > vi + • • • + v m _i. For 1 < j < m — 1, set 
hj := E _1 Vj. Lemma 4.2(b) implies x > (bi + • • • + b m _i). By Lemma 4.2(a) 
and the definition of 3, each hj := [&oj, • • • , &s-i ,j] is m (N\{0}) s . Thus, for 
< i < 5 — 1 we have > + • • • + &;, m -i > m — 1. 

Now suppose that u = Ex £ N s with m — 1 < min{x , . . . , x s _i}. Let A; be 
such that Xk = min{x , . . . , x s _i}. Let [xj denote the largest integer < x. Then 
\_Xk\ > m — 1. By (4.2) our proof will be complete once we show that there exist 
vectors rji, . . . } fj[ Xk \ £ 3 such that u > fj[ Xk \ > • • • > fji- We lose no generality 
by assuming k = 0: if k ^ 0, we construct fji, . . . ,rj[ Xk \ using R s_fc u in place of 
u. Then u > H k fj[ Xk j > ■ ■ ■ > R k f]i and the R k f]i are in 3 by Lemma 4.2(d). 
Note that u satisfies the conditions on v stated in Lemma 4.4 below. The proof 
is completed by recursively applying Lemma 4.4 to get fj[ Xk \ , ^aJ-I' • • • > Vi- 

Lemma 4.4. Suppose that v = Ec £ N s with 1 < c = min{c , . . . , c s _i}. Then 
there exists a vector w = Ed £ 3 with w < v and such that c — 1 < d = 
min{c/o, • • • , 

Proof. We define the terms of d = [do, ■ ■ ■ ,0^-1]* inductively as follows where 
[x~\ is the smallest integer > x. 

Set d := I" Co] — 1. 

Set di := min{[c 8 ] — l } pdi +i } for i = s — 1, s — 2, . . . , 1. 

We claim that 

d = mm{d 0} . . . , d s -i }• (4.3) 

Since c < c s _i we have o? s -i = niin{[c s _i] — l } pd } > o?o- Proceeding by re- 
verse induction, suppose 4+i ^ 4 for some i + 1 < 5 — 1. Then we have 
4 = niin{[c 8 ] — l } pdi + i} > d . Thus (4.3) is justified and therefore c — 1 < 
d = mm{d 0} . . . , 4-i }• 
Note that now we have 

4 = niin{[c 8 ] — l,p4 + i} for a// (4-4) 

With w := [w 0} . . . , Ws-iY := Ed we have 

Wi = pd i+1 - 4 

= max{p4 + i - \ci[ + 1,0}. 
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This implies w £ X 

We show that w < v. Fix i and assume W{ > 0. Then W{ = pdi +i — [c 8 ] + 1. 
Since Vi = pci +i — C{ we have 

Vi - Wi = p(c i+1 - d i+1 ) + \ci] — q — 1. 

By (4.4) we have c 8+ i > c/;+i. Since [c 8 ] > c 4 - we have V{ — W{ > —1. Since V{ — W{ 
is an integer we must have W{ < V{. Further, w ^ v since d < c . Thus w < v as 
desired. ■ ■ 

Define 

P ■= \P- 1, !]*• 

Our next result is a simple lemma used by the proof of Proposition 4.6 below. 

Lemma 4.5. Let u he an element of J m with m > 1. Suppose v G N s with v < p 
and v < u. Then u — v is an element of I m U J m -i (where J := 0). 

Proof. Since I\ = N s \{0}, we may assume m > 1. Set a := [a , . . . , Os-i]* : = 
E _1 u and x := [x , . . . , ^s-i]* := E _1 v. Note that E -1 p = [1 , . . . , 1]* . Lemma 
4.2(b) implies X{ < 1 for all i. Thus by Proposition 4.3(b), we have m — 1 < 
min{a — x 0} . . . , a s _i — x s _i}. By Proposition 4.3 again, we have u — v £ I m \JJ m -\. 
■ 

Proposition 4.6. Fix iV a_nd m such that p s — 1 divides N and V m (N) is not 
empty. If X is either an optimal or the greedy composition in V m (N) then 

(m - l)N < wt(X) < mN. 

Proof. Note that wt(X) < mN is true for any X in V m (N). Our proof of the 
other inequality is by induction on m. The result is trivially true when m = 1. 
Suppose it is true for m < m . Fix m = m > 1. Let 

N = n + nip + • • • + n k p k 

be the p-adic expansion of A^. Since N is divisible by p s — 1 we have u := T(N) £ Jh 
for some h > m. Set v := L(n/;p fc ). As 1 < n k < p — 1 we have v < p. Also 
v < u since m > 1 rules out the possibility that A^ = n^p k . By Lemma 4.5, 
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u — v is an element of 1^ U Jh-i- Since h > m and u — v = Y(N — n k p k ) we have 
T(N-n k p k ) G / ro U J ro _i. 

Suppose that F(N — n k p k ) G I m . Then V m (N — n k p k ) is not empty. Let 
(Y]_, . . . , Y m ) be any composition in V m -\(N — n k p k ) and set 

Y' :=(Y 1 ,...,Y m + n k p k ). 

If T(iV — n k p k ) G J m -i choose from T4i_i(./V — n k p k ) and set 

y' := (F!,...,^-!,^/). 

In either case Y 1 G V m (N) and > n^p^. 

Define G := (Gi, . . . , G m ) to be the greedy composition in V m (N). Greediness 
implies G m > Y' m . Thus 

G m > n kP k > N/2. (4.5) 

Set Z := (Cri, . . . , G m -\). Then Z is the greedy composition in V m -\(N — G m ) 
(Proposition 4.1(a)). Further, we have 

wt(G) = wt(Z) + mG m . (4.6) 

Our induction hypothesis implies 

(m-2)(N-G m ) < wt(Z). (4.7) 

Combining (4.6) and (4.7) we get 

wt(G) > (m - 2)(N - G m ) + mG m = (m - 2)N + 2G m . 

Now (4.5) implies (m — 1)N < wt(G). If O is an optimal composition in V m (N) 
then wt(0) must satisfy the same inequality. ■ 



Proposition 4.7. If X = (X\, . . . } X m ) is either an optimal or the greedy com- 
position in V m (N) then T(N — X m ) G J m -i- 

Proof. We show the contrapositive: if T(N — X m ) (/ J m -i then X is neither 
greedy nor optimal. 

Since p s — 1 divides N — X m we have T(N — X m ) G 3 H I m -i- Assume 
T(N — X m ) (/ J m -i- Then by the definitions we have T(N — X m ) G I m and thus 



21 



V m (N — X m ) ^ 0. Let Y := (Yi, . . . , Y m ) be any composition in V m (N — X m ). 
Then Y' := (Y\, . . . , Y m + X m ) is a valid composition in V m (N). Thus X cannot 
be greedy since (Y m + X m , . . . , Y\) is lexicographically larger than (X m} . . . , X\). 

Set X' := (X u ... } X m _ 1 ) G V m - X (N -X m ). Then wt(X) = wt(X') + mX m . 
Let G := (Cri, . . . , Cr m ) be the greedy composition in V^iV — X m ). Set 

Z := (Cri, . . . , G 

m — 1 , G m -\- X m J . 

Then Z G K„(iV) and wt(Z) = wt(G) + mX m . Since p s — 1 divides N — X m} 
Proposition 4.6 implies 

wt(X') < (m - l)(iV - X m ) < wt(G). 

Adding mX m across these inequalities gives wt(X) < wt(Z). Thus X cannot be 
optimal. ■ 

Our next result concerns the following subsets of J m : 

J % m := {Ea G N s : a; = m = min{a , . . . , a s _i}} . 

Proposition 4.8. If X = (X\, . . . } X m ) is either an optimal or the greedy com- 
position in V m (N) then there exists an i such that T(Xj) G J[ for 1 < j < m — 1. 

Proof. Note that 

r(Xi) + --- + r(x m _i) = r(Xi + --- + x m _i) 

= T(N-X m ). 

We argued above that each T(Xj) must be in J\. Since T(N — X m ) G J m -i by 
Proposition 4.7, the result is implied by Proposition 4.3. ■ 

Proposition 4.9. Suppose X = (X\, . . . , X m ) is either greedy or optimal in 
V m (N) and suppose p k is any element of a(X m ). Then V m (N — p k ) is empty 
if and only if X m = p k . 

Proof. If X m > p k then (Xi, . . . , X m — p k ) would be an element of V m (N — p k ). 
Thus, if V m (N — p k ) is empty then X m = p k . Suppose that X m = p k . Then 
F(N — p k ) G Jm-i by Proposition 4.7. Therefore F(N — p k ) ^ I m which is another 
way of saying V m (N — p k ) is empty. ■ 

Now we prove Theorem 1.2 for the case q = p. Recall that ^ is the set of all 
pairs (m, N) such that V m (N) contains an optimal element that is not the greedy 
element. 
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Proposition 4.10. If s = 1 then ^ is empty. 

Proof. Fix m and N so that V m (N) is not empty. Let G be the greedy and O 
any optimal element of V m (N). Note that since 5 = 1 we have J\ = {p — 1}. Thus 
TG = TO = B where B is the 1 X m matrix 

[ p-l,. . .,p-\ , T(N) - (m - l)(p - 1)]. 

m — 1 

(In this case T(N) = £(N) = the sum of the p-digits of N.) By Lemmas 3.1 and 
3.2 we have G = O = the r-monotonic element of V^(N). ■ 

5. If ^ ^ then there exists an optimal O . . . 

Sections 5, 6, and 7 constitute the proof of Theorem 1.2 for the case s > 2. In 
these sections we assume that ^ is not empty in order to derive a contradiction. 
The purpose of the present section is to prove the following proposition. 

Proposition 5.1. Suppose s > 2. If ^ is not empty then there exists an m > 2, 
an N, and an optimal element (0\ , . . . , O m ) £ V m ( N) which satisfies the properties 
listed below. Here (G\, . . . , G m ) is the greedy element of V m (N). 

(a) T(Oj) £ Ji for 1 < j < m - 1. 

(b) deg p (O m ) < deg p (G m ). 

(c) O m = p k for some k £ N. 

(d) (V> , r(G m )) = (V>o,r(O m )> yet Y(G m ) ± Y(O m ). 

Proof. Choose (m, K) £ ^ so that (m,£(K)) is lexicographically minimal. Let 
O' = (0[, . . . , 0' m ) be any optimal element of V m (K) that differs from the greedy 
element G' = (G' l7 . . . } G' m ). By Proposition 4.8, there exists an h such that 
r(O^) £ J* for 1 < j < m - 1. Define AT := p 5 -' 1 /^ and define 

0:=(0 1 ,...,O m ):=(p s - h O' 1 ,...,p s - h O' m ). 

Lemma 4.2(e) implies Part (a): 

T(Oj) = Tip'^Oj) £ J° for 1 < j < m - 1. 
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By Proposition 4.1(c), O is an optimal and 

G := {G u ...,G m ) = (p s ~ h G[, . . . ,p s ~ h G' m ) 

is the greedy element of V m (N). Clearly ^ G. Thus (m, iV) £ Further, 
since £(N) = £(K), we have that (m,£(iV)) is lexicographically minimal. Thus we 
have the following lemma. 

Lemma 5.2. If m! < m or if m! = m and £(N') < £(N) then either V m i(N') is 
empty or its greedy element is its unique optimal element. 

It is obvious that m > 1. We argue that m > 2. If G = (Gi,^) is the greedy 
element of ^(./V) then G2 > X 2 where X = (Xi 7 X 2 ) is any other element of 
V 2 (N). But then 

wt(G) - N = G 2 > X 2 = wt(X) - N. 

Thus the greedy element of V 2 (N) is the unique optimal element. This implies 
m > 2. 
Set 

o = [o , . . . ,o s _i]* = T(0 m ), 

g = [g , . . . jflfs-i]* = r(G m ), and 

u = [u , . . . ,u s _if = T(N). 

Define integers (3 and 7 by 

(3 = deg p (O m )(mod s), and 
7 = deg p (G m )(mods) 

where 0</3,7<s — 1. 

We show deg p (G m ) > deg p (O m ). By greediness, we have deg p (G m ) > deg p (O m ). 
Suppose that deg p (G m ) = deg p (O m ). Then gp and op would both be positive. But 
this contradicts following lemma. 

Lemma 5.3. For each h with < h < s — 1, either g^ = or = 0. 

Proof. Suppose there exists an h such that > and Oh > 0. By Lemma 3.1 
both O and G are r-monotonic. Thus cr(G m ) and cr(O m ) both contain the largest 
element, say p n } of Th(N). 
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Suppose first that V m (N — p n ) is empty. Then O m = G m = p n by Proposi- 
tion 4.9. By Proposition 4.1(a), (0\ } . . . , O m -\) is optimal and (G\, . . . , G m -\) 
is greedy in V m -\(N — p n ). But then Lemma 5.2 implies (0\ } . . . , m -\) = 
(Gi, . . . , G m -\) and so = G. This contradicts our choice of 0. 

Now suppose V m (N — p n ) is not empty. Then m and G m are both > p n 
by Proposition 4.9. Proposition 4.1(b) implies (0\ } . . . , m — p n ) is optimal and 
( G\ , . . . , G m — p n ) is greedy in V m (N — p n ). Again Lemma 5 . 2 implies the contra- 
diction that = G. ■ 

Thus we have Part (b). 

For Parts (c) and (d) we need Lemma 5.5. Lemma 5.5 needs Lemma 5.4 below. 

Lemma 5.4. For any x such that o x > we have 

u e^/ e^ (jz J m —\ U I m . 

Proof. Since o x > 0, Lemma 5.3 implies x ^ 7. Suppose on the contrary that 
u — e 7 — e^ G Jm-i U I m . Then there exist vectors vi, . . . , v m -i in 3 such that 

vi + . . . + v m _i < u - e 7 - e x . (5.1) 

Let p c be the largest element in t x (N). Then we have T(N — p c ) = u — e^. Since 
Vi + . . . + v m _i < u — e^ we have < v m := (u — e^) — (vi + . . . + v m _i). Proposition 
3.5, applied to the matrix B := [vi, . . . , v m ], implies that V^(N — p c ) is not empty. 
Let X be the r -monotonic element of V®(N — p c ). Note that (5.1) implies that 
the 7-th coordinate of v m is positive. Thus we have deg p (X m ) = deg p (G m ). 

Since o x > and O is r-monotonic (Lemma 3.1) we have p c G cr(O m ). Thus 
O m > p c . Proposition 4.1(b) implies that Y := (0\ } . . . , m _i, O m —p c ) is optimal 
in V m (N — p c ). But then Lemma 5.2 implies that Y is also greedy in V m (N — p c ). 
This is a contradiction since 

deg p (F m ) < deg p (O m ) < deg p (G m ) = deg p (X m ). ■ 

The Parts (c) and (d) will follow from the next lemma. Recall that ip = 
[1, p, . . . , j/ -1 ]* and ipi = JVtpo for < i < s — 1 where the matrix R rotates 
a vector's coordinates to the right. 



25 



Lemma 5.5. Let x he such that o x > and set 

v := [v , u s _i]' := E _1 (u - - e 7 ). 
Then there exists an h such that Vh < m — 1. Further, for any such h we have 

(ii) (iph,o) < (4>h,e x ) + (^,e 7 ), and 
(in) (^,e 7 ) < (tp h ,e x ). 

Proof. By Lemma 5.4 we have u — e^ — e 7 (ji J m -i U I m . Thus by Proposition 4.3 
we have min{u , • • • , ^s-i} < m — 1. Fix h such that Vh < m — 1. 

First we prove Part (i). Set a := E _1 (u— o) and b := E _1 (u— g). By Proposition 
4.7 both u— o and u— g are elements of J m -i ■ Proposition 4.3(b) implies > m — 1 
and bh > m — 1. We show that 

a h = m - 1 = b h . 

First we show = m — 1. Note that since o x > we have u — o — e 7 < u — e^ — e 7 . 
Thus 

(?/>/,, u - o - e 7 ) < (?/>/,, u - e^ - e 7 ) . (5.2) 

From Lemma 3.3(d) we have (?/>/,,, u — o) = ah(p s — 1) and (?/>/,,, u — e^ — e 7 ) = 
v h(p s — 1)- Evaluating (5.2) gives 

a h (p s - I) - p 1 ' <v h (f -I) 

where p 1 = ^ iph, e 7 ). Thus we deduce 

pi 

ah~ v h < 



p s - 1 

Since Vh < m — I < a,h and p 1 < p s — 1, we have < ah — Vh < 1. Since a/,, is an 
integer we must have ah = m — 1 (and m — 2 < Vh < m — 1). The argument that 
6/j = m — 1 is completely analogous. Since g 7 > we have u — g — e^ < u — e^ — e 7 . 
Thus 

(tph, u-g-e x )< (tp h , u - e^ - e 7 ) . 
Set p x = (ipi x} e x ^ . As above we get 

x' 

bh - Vh < 



p s — 1 
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Thus, just as a/, = m - 1, we have that bh = m — 1. Lemma 3.3(d) implies 

(V>/>, u - g> = (m - 1)(/ - 1) = u - o) . (5.3) 

Thus (^/i,g) = (V>/i, o) as desired. 

Now for Part (ii). By (5.3) and Lemma 3.3(d) we have 

(tp h , u - o) = (ra - l)(p s - 1) 
> v h (p s - 1) 
= (?/>/,, u - e^ - e 7 ) . 

From this we deduce 

(?/>/>, o) < (tp h ,e x ) + (?/>/>, e 7 ) • (5.4) 

Part (iii) is more difficult. Note that since 7/1 we have ^ ?/>/,,, e~\ 7^ {"tph^x)- 
To establish ^^,e 7 ) < (?/>/,,, e^) we show that if ^^,e 7 ) > (?/>/,,, e^) then (9 cannot 
be optimal. 

Assume ^ ?/>/,,, e~\ > e^}. Recall that 7 = deg p (G m ). Thus e 7 < g and 
therefore ^?/S^,e 7 ) < (?/>/,,, g). Part (i) now implies 

(^,e 7 ) < (^,0) . (5.5) 

Since i^phi^x) < (^567) we must have 

(^phi^i) < (^567) f° r au * such that o 4 - > 0. (5-6) 

Statement (5.6) follows because if o 4 - > and i ^ x then we have ^ ?/>/,,, e 8 -) < 
(iph,e x ) + (?/>/>, e 7 ) by (5.4). But since (^ h} e t ) } (^ h} e x ) } and (?/>/>, e 7 ) are three 
distinct powers of p and {^>hi^x\ < (^/i,^), we must have (i^h^^i) < (V'/i?^). 
Rewrite (5.5) by replacing (?/>/,,, o) with its expansion as a sum of powers of p : 



{i>h,e~,) < ^(^,e,-). (5.7) 

i, o 8 >0 j=l 

By (5.6) each ^ ?/>/,,, e^ appearing on the right hand side of (5.7) divides /?/>/,,, e 7 \. 
Thus there must be some subset of the ^ ?/>/,,, e^'s which sum to precisely (?/>/,,, e~\. 
In other words, there must exist a vector w £ N s such that w < o and < ?/>/,,, w) = 
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(iph, e ~/)- I n yet other words, there must be a positive integer W such that a(W) is 
a subsequence of <r((9 m ), and T(W r ) = w, and 

(^r(w)) = (i> h ,r( P de *»( G ^)) . (5.8) 

Using Lemma 3.3 it is easily argued that (5.8) implies W = p de §p( Gm ) (modp s — 1). 
We know that p de s P ( G m) \ s a term in cr(Oj) for some j < m. Define 

X := {0 U . . . , Oj - p de ^ G ^ + W,...,O m -W + p de ^ G ^). 

Since congruences (modp s — 1) are preserved we have X £ V m (N). Since aiW) is 
a subsequence of a(0 m ) we have W < p de §p( Gm ) since, as was shown above, 
deg p (O m ) < deg p (G m ). Thus wt(X) > wt(0) which is a contradiction. Therefore 
our supposition that (?/>/,,, e^ < (V'/i?^) must be false. ■ 

We use Lemma 5.5 to show Part (c) of the proposition: that o is the standard 
unit vector ep where (3 = deg p ((9 m )(mod s). By Lemma 5.5 (ii) and (iii) there 
exists hp such that 

(V^o) < ($ hfj ,ep) + (V'/ l/3 ,e 7 ) and (5.9) 

(i>h p ,e~,) < (i>h p ,ef)) • (5.10) 
Note that (5.9) and (5.10) imply that , < 2 ^ tph p , ep) ■ Thus 

op = l. (5.11) 

If s = 2 then = 6,3 now follows from Lemma 5.3 since 7 ^ (3. 

Suppose s > 3. Before continuing we make a simple observation about the 
vectors Fix integers a, 6, c such that 0<a<b<c<s — 1. Let < i < s — 1 
and set p a := ^ 8 ,e a ^), := (tpi^e^ and p c := ^ 4 -,e c ^. Then it is easily seen 
that: 

if i < a or c < i then p a < p b < p c ; 

if a < i <b then p 6 ' < p c ' < p a '; (5.12) 

if b < i < c then p c < p a < p b . 

We continue with our proof that o = ep when s > 3. Suppose that in addition 
to (3 there exists another index a ^ (3 such that o a > 0. Applying Lemma 5.5 
again we have that there exists h a such that 

(i , h a ,o) < (i> ha ,e a ) + (V'/ la ,e 7 ) and (5.13) 
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Since o a > and op > with a ^ f3, (5.9) implies that we must have ("iph ,e a } < 
(tph e 7 ). Similarly, (5.13) implies we must have (V>/i a , e^) < (^/i a ,e 7 ). Now it 
follows that we have 

($ hfj ,e a ) < (V'/ l/3 ,e 7 ) < ($ hfj ,ep) and (5-14) 

{i>h a ,ef}) < {i>h a ,e~,) < (k,e a ) • (5.15) 

This is a contradiction since (5.12) implies that we cannot have both (5.14) and 
(5.15) simultaneously true. Thus no a exists. This and (5.11) implies o = ep. 
This completes the proof of Part (c) of the proposition. 

Now for Part (d). By Lemma 5.3 we have T(G m ) ^ T(0 m ). To show 
(?/>o, r(Cr m )) = (^o,r(0 m )) we use Lemma 5.5(i) with x = j3. It is sufficient 
to show that v < m — 1. Note that by Part (c) we have u — ep = T(N) — T(0 m ) = 

r(o 1 ) + --- + r(o m _ 1 ). Thus 

(m-l \ 

By Part (a), T(Oj) £ J?. Thus v = (m - 1) - f / \f - 1). ■ 

6. Constructing Z G V^iV). 

For this section we assume s > 2 and that is not empty. We use the notation 
of Section 5: = (0\ } . . . , m ) £ K„(iV) where (9, m, and N have been chosen 
so that has the properties listed in Proposition 5.1; (3 = deg p (O m ) (mods), < 
(3 < s- 1; G = (Gi, . . . , G m ) is the greedy element of V m (N); g = [g , . . . , ^-i]* = 
T(G m ); u = [u , . . . , u s _i]* = T(N). 

Our ultimate goal is to arrive at a contradiction by showing that for some 
Z £ V m (N) we have wt(Z) > wt(0). In this section we construct Z . We construct 
a new matrix B from the matrix TO and the vector g. Then we let Z be the r- 
monotonic element of V^(N). Because the construction of B is complicated we 
outline it here. As a first step we form the matrix [9ij] := [#i,...,# m ] whose 
columns are the partial sums of the columns of TO : 

3 

3 := ^r(Oi), for 1 < 3 < m - 

8 = 1 
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Second we take E 1 of the result: 

[w hJ ] := [wi, . . . , w m ] := E" 1 [9 hJ ] . 

Next, using Lemma 6.1 below, we perturb the rows of [wij] to obtain the matrix 
[dij]. Finally, we reverse the first two steps: set [Sij] = [$i,...,$ m ] = E[o? 8J ]; 
then set bi = Si, b 2 = S 2 — Si, • • • , b m = S m — S m -i to obtain B = [bi, . . . , b m ] . 

To obtain [o? 8J ] we only perturb those rows of [wij] with indices between a and 
(3 where a is given by Lemma 6.1 below. 

Lemma 6.1. There exists an index a, < a < (3 such that 

(a) if gi > then a < i < (3. 

(b) Wi jfn _i > m — 1 for a < i < (3, and 

(c) = m - 1. 

Proof. Set v := u — g and c := E _1 v. By Proposition 4.7 we have v G J m -i- 
Thus the terms of c are all > m — 1 (Proposition 4.3). Set a := E _1 (g — ep). By 
Proposition 5.1 we have T(0 m> ) = ep and so u — ep = 6 m -i- Thus 

c + a = E-^u - e^) = E _1 (# m _i) = w m _L (6.1) 

We show that a is in N s . Note that (6.1) implies a G Z s since c and w m _i are both 
in N s . By Proposition 5.1(d) we have (tp ,g — e/?) = 0. Thus a = (Lemma 
3.3(d)). Proceeding by induction, suppose a,h-i > 0. Note that since g := Ea + e^ 
we have 

_ f pah - dh-i if h - 1 ^ (3 , . 

pa h -a h _i + l ifh-l = (3 ■ [b - Z) 

Since gh-i > and p > 2 we have ah > 0. Thus a G N s . 

By Proposition 5.1(d) we have ^ ,g) = (t^o,^) = pP yet g ^ ep. Thus 
we have gi = for (3 < i < s — 1. Suppose (3 < s — 1. Since a = and 
= g s _i = a p — a s _i we have a s _i = 0. Now an easy induction argument implies 
that a,i = for (3 < i < s — 1. Thus if /3 < 5 — 1 then ap + i = 0. Also, if (3 = s — 1 
then a^+i = a = as noted above. With h — 1 = (3, (6.2) implies ap = 1. Let a' 
be smallest such that a„/ + i > 0. Then a' < (3. Proceeding by induction, suppose 
cih-i > where a' + l < h — 1 < (3. Since gu-i > 0, (6.2) implies > 0. We 
conclude that 

a G N s and a l > if and only if a' < i < (3. (6.3) 
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Now (6.1) and (6.3) imply 

Wi jfn _i > m — 1 for a < i < (3. (6-4) 

Further, (6.3) and (6.2) imply 

if g l > then a' <i< (3. (6.5) 

Note that Proposition 5.1(a) implies 6 m -i £ Jm-i- Thus wo, m -i = m — 1. Define 
a to be the maximum integer j < a' such that w hm _i = m — 1. Thus we have 
Part (c). Parts (a) and (b) follow from (6.5) and (6.4). ■ 

We now use [wij] to define [o? 8J ] = [di, . . . , d m ] . First define d m := E _1 r(iV). 
We recursively define the remaining elements of [o? 8J ] . Define 

f u^i,m-i if0<z<aor/3<z<s — 1 

8 ' m_1 ' \ min{^ jm _i - l,po? 8+ i jm _i} for z = /?,/? - 1, . . . ,a + 1 

and for j = m — 2, m — 3, . . . , 1, define 

^ f u;j-j if0<z<aor/3<z<s — 1 

M ' \ mm{d hJ+1 - l,pd i+ltj } for i = (3, (3 - 1, . . . , a + 1 

Note that for j ' < m — 1, since the vectors #j are members of 3, we have that the 
entries of the vectors Wj are integers. Thus the definition implies that the dj are 
integer vectors. Now we define the integer matrix 

[8 hJ ] := pi,...,^ m ] := E[d hJ ] 

Many of the arguments of Section 7 depend on the following result which will not 
only allow us to define Z but will also describe its structure in comparison to 0. 

Lemma 6.2. Let [u i} . . . , u m f := T(N). Then 

for 0<h<aor(3<h<s — 1, 
for a < h < (3, 

for 0<h<aor(3<h<s — 1, 
for a < h < (3. 



(a) &h;m-\ = U h 

(b) < ^, m _i < ma,x{u h - (p - 1), 0} 

(c) < £„, m -i < u a - p. 
For 1 < j < m — 2 we have 

(d) 8 K3 = e K3 

(e) < 8 htj < max{4, J+ i - (p - 1), 0} 
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Proof. We start with Part (a). For 0<h<aoi(3<h<s — 1 we have 

$h,m-l = pdh+l,m-l ~ dh,m-l 
= Qh,m-1- 

Since 6 m -i = u — e^ we have 8h, m -i = u h for these indices. Note that we also have 

$P,m-l = Pdf3+l } m-l — dp }Tn -i 

= max{p«J W i im _i - w/?, m _i + 1,0} 
= max{^ im _! + 1, 0} 

= Qf3,m-1 + 1 
= Up. 

This completes Part (a). 

For Part (b) fix h, a < h < (3. Then 

$h,m-l = pdh+l,m-l — dh } m-l 

= max{^4+i,m-i - Wh,m-i + 1,0}. 
Since d h+ i, m _i < w h+ i, m _i - 1 we have 

< ^, m -i < max{|)«)i + i im .i - w Km _ x - (p - 1), 0} 

= max{^ im _! - (p - 1),0} 

= max{uh — (p — 1), 0}. 

So we have Part (b). 

For Part (c) we must show < $a, m -i < u a —p. We have (^m-i = pd a+ i jfn _i — 
d a ,m-i- Since d a ^ m _i = w a ^ m _i and o? a _|_i jm _i < t/; a _|_i jm _i — 1, 

3a,m — 1 pda-\-l,m — l — 1 

(6.6) 

< p(^„ + i jm _i - 1) - ^«,m-l 

= Ui - p. 
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We still must show < $„ jm _i. Since w ajfn _i = m — 1 (Lemma 6.1(c)), we see 
from (6.6) that it is sufficient to show o? a+ i jm _i > m — 1. Since 6 m -i G Jm-i, 
Proposition 4.3 implies o?/?+i, m — l = ^/3+i,m-i > m — 1. We proceed by reverse 
induction. Suppose we have dh+i jm -i > m — 1, for some a<h-\-l<(3-\-l. Then 

G^, m _i = min{^ )m _i - l,p^ + i im _i}. 

Since Wh, m -i ~ 1 > m — 1 (Lemma 6.1(b)), we have dh, m -i > m — 1. Thus, by 
induction 

dh,m-i > m — 1 for a < h < (3 } (6-7) 
and in particular, o? a+ i jm _i > m — 1. 

For Part (d) fix 1 < j < m — 2. For 0</i<aor/?</*<s-l we have 

Sh,j = pdh+i, 3 - dfij = pwh+1,3 - Wh, 3 = 0h, 3 - 

For Part (e) we must show < 8h,j < max{^/ lJ+ i — (p— 1), 0} for a < h < (3 and 
1 < j < m — 2. For these j's for and a < h < (3 we have 

<^/ij = pdh+1,3 — dh }J 

= max{p4 + i,j - 4j+i + 1,0}. (6.8) 

If h is strictly less than /3 then (4,+ij < (4,+ij+i — 1 by definition. Thus 

< <^/ij < max {^ J+1 - (p - 1), 0} for a < h < [3. 

We deal with the indices a and (3 separately starting with (3. Note that 
E(w n+ i - w n ) = n+ i -0 n = Y(O n+ i). Since Y(O n ) G J\ for 1 < n < m - 1, 
Proposition 4.3 implies 

Wh, n +i — ^h,n > 1 for all /i and 1 < n < m — 2. (6-9) 

By definition dp + i jfl = wp + i jfl for each n. Thus 

< d p+lij+1 - 1. (6.10) 

Combining (6.8) and (6.10) gives 

< 5p tj < max{^j + i - (p - 1), 0}. 
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Now for the index a. By Lemma 6.1 we have w a , m _\ = m — 1. Thus (6.9) implies 
w a ^n = n for 1 < n < m — 1. By definition o? ajn = for 1 < n < m — 1. Thus 
for j < m — 2 we have 

8 a ,j = pda+i, 3 - ij (6-H) 
< p(o?„+i J+ i - 1) - (d a ,j +1 - 1) 
= 8 aij+1 -(p-l). 

We still must show 8 a j > 0. By (6.11) and since d a j = j, it is sufficient to 
show dhj > j for a < h < (3. We proceed by reverse induction on j and then on 
h: our basis step is (6.7). Suppose dhj+i > j + 1 for a < h < (3 and j + 1 < m — 1 
then 

dp tj = mm{dp tj+1 - l,pdp +ltj } > j 

since = > j (Proposition 4.3). Suppose for some a < h + 1 < (3 we 

have dh+i,j > j. Then 

d htj = min{4 J+ i - l,pd h+hj } > j. 

Thus dhj > j for all a < h < /3 and 1 < j < m — 2. ■ 

From the various parts of Lemma 6.2 we have < 8\ < . . . < $ m _i < 8 m . For 
< j < m — 1, since #j G </j we have iuqj = J . Since o? j = ^Oj, we have 

< 8 t < . . . < £ m -i < 8 m . 

For j < m — 1, since dj is an integer vector and ^ > 0, we have 8j G 3- Define 
bi := ^i, b 2 := $2 — • • • , b m := 8 m — 8 m -\. Then for j < m — 1, we have bj G 
3- Now define B := [bi, . . . , b m ]. Then the columns of B sum to T(iV). Lemma 
3.5 implies that V®(N) is not empty. Finally we can define Z := (Zi, . . . , Z m ) to 
be the r-monotonic element of V®(N). 

7. The composition O is not optimal 

We continue to assume s > 2 and that ^ is not empty. We use the notation of 
Sections 5 and 6. Thus ra, N, and O = (Oi,...,O m ) G V m (N) are chosen so 
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that O has the properties listed in Proposition 5.1. Further, Z £ V m (N) is as 
constructed in Section 6. For 1 < j < m set 

Zj : = Z\ + Z 2 + • • • + Zj, and 

6- : =o 1 + o 2 + --- + o 3 . 

Then we have r(Zj) = 8j and T(Oj) = 6j where the matrices [$ 8J ] = [8\, . . . , 8 m ] 
and [^-j] = [#!,..., # m ] are as in Lemma 6.2. In this section we complete the proof 
of Theorem 1.2 by showing that wt(Z) > wt(0). 
It is routine to verify that 

wt(Z) = mN — (Z\ + • • • + Z m -\) and 
wt(0) = mN-(d 1 + --- + O m - 1 ). 

Subtracting we get 

m — 1 

wt(z)-wt(0) = J2(dj-z J ). 

3 = 1 

Note that Z m _i = N — Z m and m -\ = N — m . By Proposition 5.1(c) there 
exists a positive integer k such that m = p k . Thus m -\ — Z m _i = Z m — p k and 
so we have 

(m-2 \ 
/ + J](Z J -0 J )J. (7.1) 

Since Z is r-monotonic, we have that Th(N) is the concatenation of the se- 
quences Th(Z\), . . . , Th(Z m ). For 1 < j < ra, since Zj = Z\ + • • • + Zj, we have 
that Th(Zj) is the concatenation of the first j sequences t/^Zi), . . . , T/^Zj). Note 
also that since r(Zj) = the length of Th(Zj) is ^j. For < /j < s — 1 and 
1 < z < Uh define 77^ to be the i th term of Th(N). Then, for example, 

r h (Zj) = (T h)1 ,T h)2 , • • • , T h)Sh j ) and 

T/i(Z m ) = (T ht S hjn _ 1 +l, T h) S h m _ 1 +2, ■ ■ ■ , Th,u h ) 

Define Th,o '■= for each h. This is done so that facts such as 

s—1 &h,j s-1 8h,j 

= J2Y1 n > i and ®i = n > i ( 7 - 2 ) 

h=0 i=0 h=0 i=0 
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may be unambiguously expressed even when some of the 8h,j or 6h,j happen to be 



zero. 



Our next result gives an upper bound on part of expression (7.1). 
Lemma 7.1. We have 

m-2 p-l 
j=l h=a 

where a, and (3 are as in Section 6. 

Proof. Combining the expressions of (7.2) we have for all 1 < j < m — 2, 

S—l I $h,j 8h,j 

^3 - o 3 = Y [Y Th >* ~ Y Th < 

h=0 \ 8=0 i'=0 

By Lemma 6.2(d) we have 

%3 -dj = Y[Y Th >* ~ Y Th - 

h=a \ 8=0 i'=0 

P S h, 3 

< YY t ^- ( 7 - 3 ) 

h=a «=0 

Recall that for each < h < s — 1 we have 77^ = p h+ns for some integer n > 0. 
If T K&Kj = p h+t % then 

8 h,J 

Y ^ < p h+ts+1 

8 = 

since Th,s hj is the largest summand and any particular power of p can appear 
at most p — 1 times in the sequence Th(N). By Lemma 6.2(e), if 8h,j > then 
8h,j+i — 8h,j > p — 1. This implies there are at least p — 1 terms x of Th(N) such 
that Th t s h j < x < Th t s hj+1 . Hence, we must have Th t s hj+1 > ph+ts+s ^ pj-^g r or an y 
< h < 5 — 1 and 1 < j ' < m — 2 we have 



< (7-4) 



8 = 
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Combining 7.3 and 7.4 we deduce 



m — 2 (3 m — 2 

^2 — s-i ' n '° h .j+ 1 

j = l h=a j = l 



13 m-1 

T h,$h,j 



^ fi m-1 

fjS—l ^ y ^ y 
F /i=a j = 2 

By Lemma 6.2(e), the nonzero summands among 2 , . . . } Th,8 hm _ 2 are au dis- 
tinct powers of p. Thus the sum of these terms is < Th,s h m _i (with equality if and 
only if T htShm _ 1 = ). This implies 



m-2 2 P P 

j = l h=a h=a 



since we are assuming s > 2 and p > 2. 

By Lemma 6.2(a), we have <^? )m _i = u^. Thus r^ jl 5 /3m _ 1 is the largest term in 
Tp(N). We have by Proposition 5.1(c) that m = p k . Recall that k = (3 (mods). 
Since is r-monotonic (Lemma 3.1) we have Tp } s l3m _ 1 = p k ■ B 

Combining Lemma 7.1 with (7.1) we deduce 



wt(Z) - wt(O) > Z m - I 2p k + > ' r,,.^ ) . (7.5) 



\ h=a / 

Now we give some lower bounds on Z m . 
Lemma 7.2. deg p (Z m ) > k. 

Proof. Recall that (G\, . . . , G m ) is the greedy element of V m (N). It is sufficient 
to show deg p (Z m ) > deg p (G m ) since deg p (G m ) > k (Proposition 5.1(b)). Define 7 
by 7 = deg p (G m ) (mods) , < 7 < s — 1. Then u 1 > where u = [u , . . . , u s _i]* = 
T(A^). By Lemma 6.1(a) we have a < 7 < (3. Since u 7 > 0, Parts (b) and (c) of 
Lemma 6.2 imply that $ 7jm _i is strictly less than it 7 . Recall that 

r(Z m ) = b m = [6o,m 5 • • • 5 &s-l,m] = u — 8 m -i. 
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Thus b~ f , m > 0. Since b~ f , m is the length of r 7 (Z m ) and since Z is r-monotonic, 
the sequence r 7 (Z m ) contains the largest b i;m terms in r 7 (iV). Since p de s P ( G m) \ s a 
term in T 7 (iV), we have deg p (Z m ) > deg p (G m ). ■ 



Lemma 7.3. Jf X]f=a r M/i m-i JS positive then 

/p-i 

deg„(Z m ) > deg„ Vt m , , I + 1. 




Proof. Suppose Yl= a T h,s h , m -i is positive and let r = deg p (j2h=l T h,s h , m -i 
Since the non-zero terms Th,s h m _i are all distinct powers of p there is no carryover 
of p-adic digits in their sum. Thus T Vt s m _ 1 = p r for some a < r/ < (3 and we must 
have 8 Vtm -i > 0. Lemma 6.2(b) and (c) imply 

br/,m ^rj — 1 ^ P 1 

where [iti, . . . , u m ]* = r(iV). Thus there are at least p — 1 terms in the sequence 
T v (Z m ) (since its length is b Vjfn ). Since Z is r-monotonic, all of these terms are 
greater than or equal to p r . Since there can be a maximum of p — 1 occurrences 
of the same term appearing in all of t v (N) } the largest term in T v (Z m ) must be 
> p r+s . The result follows since we are assuming s > 2. ■ 

Finally, we complete the proof of Theorem 1.2. 

Proposition 7.4. If s > 2 then ^ is empty. 

Proof. Set Q := p k + J2h=l r Mh,m-i- B y ( 7 - 5 ) ^ is sufficient to show Z m > Q+p k . 
Recall that for each h if Th,s h m _i 1S n °t zero then it equals p c for some c = h 
(mods). Since k = /3(mods) the p-digits of Q consist of zeros and ones. Thus, if 
p > 3 then 

deg p (Q + p k ) = max <^ k, deg p I ^ ^A.^-i 

Lemmas 7.2 and 7.3 imply that if p > 3 then Z m > Q + So we assume p = 2. 
We see that 

deg p (Q + p k ) < max < A; + 1, deg p I r fc , 5him _ 1 ] + 1 

I \h=a 
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By Lemma 7.3 we may assume deg p (Q -\-p k ) = k + 1 = deg p (Z m ). Recall that Z is 
r-monotonic and that T(Z m ) = u — $ m _i where u = [iti, . . . , u m ]* = r(iV). Since 
k + 1 = deg p (Z m ) we must have up + i — <£/?+i )m _i > 0. But then Lemma 6.2(a) 
implies (3 = s — 1 and k + 1 = a = 0(mods). Since the length of the sequence 
Th(Z m ) is Uh — 8h,m-ii Lemma 6.2(b) implies that if a < h < (3 and Uh > then 
Th(Z m ) has at least one term t/,,^. Lemma 6.2(c) implies that T a (Z m ) has at least 
two terms T 0}Ua and r aj „ a _i. Note that T 0}Ua = p k+1 since A; + 1 = deg p (Z m ). To 
sum up the last three sentences, we have 

13-1 
h=a + l 

Subtracting Q -\- p k we get 

13-1 

Z m — (Q + p k ) > r„ iUa _i — T a ^s a>m _ 1 + ^ ( r /i,« fe _ r Mfc,m-i) • 

/i=a + l 

By Lemma 6.2(c), we have u„ - £„, m -i > 2. Thus T„ jUa _i - r„ ilSc( m-1 > since, 
as p = 2, any power of p can appear as a r 8J - at most once. Clearly we have 
YfhZl+i T h,u h - ^A.m-i > °- Tll us Z m > Q + p k as desired. ■ 
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